智能论文笔记

Towards Explainability in Modular Autonomous Vehicle Software

Hongrui Zheng , Zirui Zang , Shuo Yang , Rahul Mangharam

分类：机器人

2022-12-01

Safety-critical Autonomous Systems require trustworthy and transparent decision-making process to be deployable in the real world. The advancement of Machine Learning introduces high performance but largely through black-box algorithms. We focus the discussion of explainability specifically with Autonomous Vehicles (AVs). As a safety-critical system, AVs provide the unique opportunity to utilize cutting-edge Machine Learning techniques while requiring transparency in decision making. Interpretability in every action the AV takes becomes crucial in post-hoc analysis where blame assignment might be necessary. In this paper, we provide positioning on how researchers could consider incorporating explainability and interpretability into design and optimization of separate Autonomous Vehicle modules including Perception, Planning, and Control.

translated by 谷歌翻译

Local_INN: Implicit Map Representation and Localization with Invertible Neural Networks

Zirui Zang , Hongrui Zheng , Johannes Betz , Rahul Mangharam

分类：机器人

2022-09-24

机器人定位是使用地图和传感器测量结果找到机器人姿势的反问题。近年来，可逆神经网络（INNS）成功地解决了各个领域的模棱两可的反问题。本文提出了一个解决旅馆本地化问题的框架。我们设计了一个在逆路径中提供隐式映射表示形式的旅馆。通过对评估中的潜在空间进行采样，局部\ _inn输出机器人以协方差构成，可用于估计不确定性。我们表明，本地\ _inn的本地化性能与延迟较低的当前方法相当。我们使用训练集的外观显示了从本地\ _inn的详细的2D和3D地图重建。我们还使用本地\ _inn提供了全球本地化算法来解决绑架问题。

translated by 谷歌翻译

Accelerating Online Reinforcement Learning via Supervisory Safety Systems

Benjamin Evans , Johannes Betz , Hongrui Zheng , Herman A. Engelbrecht , Rahul Mangharam , Hendrik W. Jordaan

分类：机器人

2022-09-22

深度强化学习（DRL）是一种仅从演示和经验中学习机器人控制政策的有前途的方法。为了涵盖机器人的整个动态行为，DRL训练是通常在仿真环境中得出的主动探索过程。尽管这种模拟培训廉价且快速，但将DRL算法应用于现实世界的设置很困难。如果对代理进行训练直到它们在模拟中安全执行，则由于模拟动力学和物理机器人之间的差异引起的SIM到真实差距，将其传输到物理系统很困难。在本文中，我们提出了一种在线培训DRL代理的方法，可以使用基于模型的安全主管在实体车辆上自动驾驶。我们的解决方案使用监督系统检查代理选择的操作是安全还是不安全，并确保在车辆上始终采取安全措施。这样，我们可以在安全，快速，有效地训练DRL算法的同时绕过SIM到现实的问题。我们提供各种现实世界实验，在线培训一辆小型实体车辆，可以自动驾驶，没有事先模拟培训。评估结果表明，我们的方法在未崩溃的同时提高了样品效率的训练代理，并且受过训练的代理比在模拟中训练的代理表现出更好的驾驶性能。

translated by 谷歌翻译

Teaching Autonomous Systems Hands-On: Leveraging Modular Small-Scale Hardware in the Robotics Classroom

Johannes Betz , Hongrui Zheng , Zirui Zang , Florian Sauerbeck , Krzysztof Walas , Velin Dimitrov , Madhur Behl , Rosa Zheng , Joydeep Biswas , Venkat Krovi

分类：机器人

2022-09-21

尽管机器人学课程在高等教育方面已建立，但这些课程通常专注于理论，有时缺乏对开发，部署和将软件应用于真实硬件的技术的系统覆盖。此外，大多数用于机器人教学的硬件平台是针对中学水平的年轻学生的低级玩具。为了解决这一差距，开发了一个自动驾驶汽车硬件平台，称为第1 f1 f1tth，用于教授自动驾驶系统。本文介绍了以“赛车”和替换考试的竞赛为主题的各种教育水平教学模块和软件堆栈。第1辆车提供了一个模块化硬件平台及其相关软件，用于教授自动驾驶算法的基础知识。从基本的反应方法到高级计划算法，教学模块通过使用第1辆车的自动驾驶来增强学生的计算思维。第1辆汽车填补了研究平台和低端玩具车之间的空白，并提供了学习自主系统中主题的动手经验。多年的四所大学为他们的学期本科和研究生课程采用了教学模块。学生反馈用于分析第1个平台的有效性。超过80％的学生强烈同意，硬件平台和模块大大激发了他们的学习，而超过70％的学生强烈同意，硬件增强了他们对学科的理解。调查结果表明，超过80％的学生强烈同意竞争激励他们参加课程。

translated by 谷歌翻译

Differentiable Safe Controller Design through Control Barrier Functions

Shuo Yang , Shaoru Chen , Victor M. Preciado , Rahul Mangharam

分类：机器学习

2022-09-20

基于学习的控制器，例如神经网络（NN）控制器，可以表现出很高的经验性能，但缺乏正式的安全保证。为了解决此问题，已将控制屏障功能（CBF）应用于安全过滤器，以监视和修改基于学习的控制器的输出，以确保闭环系统的安全性。但是，这种修饰可能是近视的，具有不可预测的长期影响。在这项工作中，我们提出了一个安全的NN控制器，该控制器采用了基于CBF的可区分安全层，并研究了基于学习的控制中安全的NN控制器的性能。具体而言，比较了两个控制器的公式：一个是基于投影的，另一个依赖于我们提出的集合理论参数化。两种方法都证明了在数值实验中使用CBF作为单独的安全滤波器的改进的闭环性能。

translated by 谷歌翻译

Game-theoretic Objective Space Planning

Hongrui Zheng , Zhijun Zhuang , Johannes Betz , Rahul Mangharam

分类：机器人 | 人工智能

2022-09-16

自主赛车奖的代理商对反对者的行为做出反应，并以敏捷的操纵向沿着赛道前进，同时惩罚过度侵略性和过度保守的代理商。了解其他代理的意图对于在对抗性多代理环境中部署自主系统至关重要。当前的方法要么过分简化代理的动作空间的离散化，要么无法识别行动的长期影响并成为近视。我们的工作重点是应对这两个挑战。首先，我们提出了一种新颖的降低方法，该方法封装了不同的代理行为，同时保留了代理作用的连续性。其次，我们将两种代理赛车游戏制定为遗憾的最小化问题，并通过遗憾的预测模型为可行的反事实遗憾最小化提供了解决方案。最后，我们在规模的自动驾驶汽车上实验验证了我们的发现。我们证明，使用拟议的游戏理论规划师使用代理表征与客观空间显着提高了对不同对手的获胜率，并且在看不见的环境中，改进可以转移到看不见的对手。

translated by 谷歌翻译

Mapping smallholder cashew plantations to inform sustainable tree crop expansion in Benin

Leikun Yin , Rahul Ghosh , Chenxi Lin , David Hale , Christoph Weigl , James Obarowski , Junxiong Zhou , Jessica Till , Xiaowei Jia , Troy Mao

分类：计算机视觉 | 机器学习

2023-01-01

Cashews are grown by over 3 million smallholders in more than 40 countries worldwide as a principal source of income. As the third largest cashew producer in Africa, Benin has nearly 200,000 smallholder cashew growers contributing 15% of the country's national export earnings. However, a lack of information on where and how cashew trees grow across the country hinders decision-making that could support increased cashew production and poverty alleviation. By leveraging 2.4-m Planet Basemaps and 0.5-m aerial imagery, newly developed deep learning algorithms, and large-scale ground truth datasets, we successfully produced the first national map of cashew in Benin and characterized the expansion of cashew plantations between 2015 and 2021. In particular, we developed a SpatioTemporal Classification with Attention (STCA) model to map the distribution of cashew plantations, which can fully capture texture information from discriminative time steps during a growing season. We further developed a Clustering Augmented Self-supervised Temporal Classification (CASTC) model to distinguish high-density versus low-density cashew plantations by automatic feature extraction and optimized clustering. Results show that the STCA model has an overall accuracy of 80% and the CASTC model achieved an overall accuracy of 77.9%. We found that the cashew area in Benin has doubled from 2015 to 2021 with 60% of new plantation development coming from cropland or fallow land, while encroachment of cashew plantations into protected areas has increased by 70%. Only half of cashew plantations were high-density in 2021, suggesting high potential for intensification. Our study illustrates the power of combining high-resolution remote sensing imagery and state-of-the-art deep learning algorithms to better understand tree crops in the heterogeneous smallholder landscape.

translated by 谷歌翻译

On the Interpretability of Attention Networks

Lakshmi Narayan Pandey , Rahul Vashisht , Harish G. Ramaswamy

分类：机器学习

2022-12-30

Attention mechanisms form a core component of several successful deep learning architectures, and are based on one key idea: ''The output depends only on a small (but unknown) segment of the input.'' In several practical applications like image captioning and language translation, this is mostly true. In trained models with an attention mechanism, the outputs of an intermediate module that encodes the segment of input responsible for the output is often used as a way to peek into the `reasoning` of the network. We make such a notion more precise for a variant of the classification problem that we term selective dependence classification (SDC) when used with attention model architectures. Under such a setting, we demonstrate various error modes where an attention model can be accurate but fail to be interpretable, and show that such models do occur as a result of training. We illustrate various situations that can accentuate and mitigate this behaviour. Finally, we use our objective definition of interpretability for SDC tasks to evaluate a few attention model learning algorithms designed to encourage sparsity and demonstrate that these algorithms help improve interpretability.

translated by 谷歌翻译

Current State of Community-Driven Radiological AI Deployment in Medical Imaging

Vikash Gupta , Barbaros Selnur Erdal , Carolina Ramirez , Ralf Floca , Laurence Jackson , Brad Genereaux , Sidney Bryson , Christopher P Bridge , Jens Kleesiek , Felix Nensa

分类：人工智能

2022-12-29

Artificial Intelligence (AI) has become commonplace to solve routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. We project that the gap between the number of imaging exams and the number of expert radiologist readers required to cover this increase will continue to expand, consequently introducing a demand for AI-based tools that improve the efficiency with which radiologists can comfortably interpret these exams. AI has been shown to improve efficiency in medical-image generation, processing, and interpretation, and a variety of such AI models have been developed across research labs worldwide. However, very few of these, if any, find their way into routine clinical use, a discrepancy that reflects the divide between AI research and successful AI translation. To address the barrier to clinical deployment, we have formed MONAI Consortium, an open-source community which is building standards for AI deployment in healthcare institutions, and developing tools and infrastructure to facilitate their implementation. This report represents several years of weekly discussions and hands-on problem solving experience by groups of industry experts and clinicians in the MONAI Consortium. We identify barriers between AI-model development in research labs and subsequent clinical deployment and propose solutions. Our report provides guidance on processes which take an imaging AI model from development to clinical implementation in a healthcare institution. We discuss various AI integration points in a clinical Radiology workflow. We also present a taxonomy of Radiology AI use-cases. Through this report, we intend to educate the stakeholders in healthcare and AI (AI researchers, radiologists, imaging informaticists, and regulators) about cross-disciplinary challenges and possible solutions.

translated by 谷歌翻译

Interactive Segmentation of Radiance Fields

Rahul Goel , Dhawal Sirikonda , Saurabh Saini , PJ Narayanan

分类：计算机视觉

2022-12-27

Radiance Fields (RF) are popular to represent casually-captured scenes for new view generation and have been used for applications beyond it. Understanding and manipulating scenes represented as RFs have to naturally follow to facilitate mixed reality on personal spaces. Semantic segmentation of objects in the 3D scene is an important step for that. Prior segmentation efforts using feature distillation show promise but don't scale to complex objects with diverse appearance. We present a framework to interactively segment objects with fine structure. Nearest neighbor feature matching identifies high-confidence regions of the objects using distilled features. Bilateral filtering in a joint spatio-semantic space grows the region to recover accurate segmentation. We show state-of-the-art results of segmenting objects from RFs and compositing them to another scene, changing appearance, etc., moving closer to rich scene manipulation and understanding. Project Page: https://rahul-goel.github.io/isrf/

translated by 谷歌翻译